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ABSTRACT: 

The  decay  constant  and  initial  activity  for  a  single  exponential  decay, 
together  with  a  constant  background  activity,  are  assumed  to  be  the  three 
adjustable  parameters  for  a  least  squares  fit  of  experimental  data.  The 
expected  variances  of  these  three  parameters  are  predicted  mathematically  for 
the  general  case  where  it  is  assumed  that  experimental  decay  counts  with 
inherent  statistical  uncertainty  are  collected  in  successive  equal  time  inter- 
vals of  negligible  uncertainty.  Comparisons  of  a  parameter  uncertainty  with 
respect  to  the  time  interval,  number  of  intervals  and  background  activity  are 
based  on  a  fixed  "confidence  interval"  for  the  parameter.  Assumed  background 
rates  are  expressed  as  fractions  of  the  initial  decay  activity.  The  results 
can  be  siirply  applied  to  arbitrarily  assumed  values  of  decay  constant  and 
initial  activity. 

This  work  was  supported  in  part  by  the  Office  of  Naval  Research  through  the 
Naval  Postgraduate  School  Research  Foundation  Program. 
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INTRODUCTION 

The  experimenter  measuring  a  radioactive  decay  very  often  must  decide 
both  the  sizes  of  the  time  intervals  for  each  group  of  counts  recorded  and 
the  total  number  of  groups  of  counts  to  be  recorded.  Analysis  of  the  data 
then  permits  determination  of  the  decay  activity,  the  decay  constant,  and 
the  background  activity  with  hopefully  sufficient  precision  for  one  or 
more  of  these  quantities.   It  is  hoped  that  this  paper  may  be  useful  in 
making  the  above  decisions  for  the  measurement  of  a  single  decay  plus 
background,  where  the  multi-scalar  mode  of  a  multi-channel  analyser  is  often 
employed,  and  where  the  data  are  least- squares  analysed.   Only  statistical 
counting  errors  are  treated.   It  is  assumed  that  decay  counts  are  recorded 
successively  in  equal  time  intervals  of  negligible  uncertainty.   The  least 
squares  analysis  of  these  data  then  yields  the  calculated  values  of  the 
three  parameters:  decay  constant,  initial  activity,  and  constant  back- 
ground activity.   The  unbiased  estimates  of  the  variances  (standard  devia- 
tions squared)  of  the  above  parameters  are  also  given  by  least  squares 
analysis. 

Prediction  analysis  applied  to  the  type  of  experiment  stated  above, 
yields  the  predicted  values  of  the  three  variances  for  assumed  values  of 
the  three  corresponding  parameters  for  a  given  time  interval  and  given 
number  of  data  points.  The  values  of  these  predicted  variances  are  com- 
parable to  the  average  values  which  would  be  obtained  if  an  actual  ex- 
periment (with  the  above  assumed  values  of  parameters,  time  interval  and 
number  of  points)  were  repeated  a  large  number  of  times. 

The  numerical  results  obtained  are  used  to  make  comparisons  of  the 
predicted  uncertainties  of  any  one  of  the  parameters  over  large  ranges  of 
values  of  time  interval  and  number  of  points.   Assuming  that  approximate 
values  of  the  parameters  are  known  in  advance,  the  results  shown  may  then 
be  used  as  a  guide  for  the  experimenter  to  make  valid,  practical  deci- 
sions regarding  time  interval  and  number  of  points  in  order  to  achieve 
some  particular  desired  accuracy  in  one  or  more  of  the  parameters  calcu- 
lated by  least  squares  analysis. 
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ANALYSIS 

Some  Pertinent  Features  of  the  Least  Squares  Analysis 

The  calculated  total  activity  ri  in  counts  per  unit  time  corres- 
ponding to  what  would  be  registered  by  a  counting  apparatus  can  be 
represented  as, 

n  =  n  e    +  n, 

where  t  is  the  time,  ri  and  X  are  the  initial  activity  and  decay 
constant  of  the  exponential  decay,  and  n,   is  the  constant  background 
activity.   Integration  of  the  above, results  in: 

n  -  (n  A)(l  -  e"Xt)  +  fit 
co  b 

where  n   represents  the  calculated  number  of  counts  corresponding  to 

those  registered  during  the  time  t. 

Now  if  counts  are  registered  during  equal  time  intervals  T   starting 

at  t  -  0,  then  t .  ■  iT    (i  =  0,  1,  Q)^  represents  the  beginning 

time  of  the  i'th  counting  interval,  there  are  a  total  of  M  *  Q  +  1 

counting  intervals  and  thus  M  experimental  points.  Applying  the  above 

we  have 

n.  =  (nQA)  [e'iXT(l  -  e"XT)  +  C^/^Xt]  (1) 

i  =  0,  1,  Q 

where  n.   represents  the  calculated  value  of  the  number  of  counts  in  the 
i'th  counting  interval.   In  later  portions  of  the  paper  the  ratio  of 
n,   to  n   is  expressed  by  the  symbol  e  ,  ie.  e  «  n,  /ri  .   It  will  be 
assumed  that  A.  ,  fi  and  n,   represent  the  calculated  values  of  the  three 
parameters  based  on  a  least  squares  analysis  of  M  experimental  points. 
Thus,  n.   in  Eq.  (1)  is  the  least-squares-calculated  valme  of  counts  for 
the  i'th  time  interval. 

An  actual  measurement  over  the  M  time  intervals  of  width  t  , 
records  N.   counts  in  the  i'th  interval,  subject  to  statistical  fluctua- 
tions.  Thus  the  estimate  of  the  standard  deviation  of  N.  is  given  by 

t 2  3  4  x 

y  N.   .   A  least  squares  analysis  '  '   of  the  M  points  determines  those 
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values  of  the  parameters  A  ,  n   ,  ri   which  minimize  the  sum  of  the  weighted 
squares  of  the  residuals  given  by  S  =  E   (1/N.)(N.  -  n.)2.   The  term 

iJL       JL         _L 
=o 

(N.  -  n.)  is  the  i'th  residual  and  1/N.  is  the  "weight"  of  the  i'th  point. 
If  a  particular  experiment  with  M  points  were  repeated  many  times,  the  mean 
value  of  S  would  be  (M  -  3) .   (M  -  3)  is  the  number  of  "degrees  of  freedom"  - 
in  general  the  number  of  degrees  of  freedom  (denoted  symbolically  by  n  -  p) 
equals  the  number  of  points  minus  the  number  of  parameters. 

It  is  assumed  that  an  iterative  least  squares  analysis  has  been  made 
(usually  by  computer),  whereby  the  function  in  Eq.  (1)  is  linearized  with 
respect  to  the  parameters  A  ,  n  and  xl    for  each  i'th  point.   The  linearization 
involves  the  following  quantities  for  each  point: 

1   3n.         .      1   3n.  1   3n. 

ai  =  Tff,  3A1   '      bi  =  7$ .  351   '     Ci  =  ?N.  3&£  (2) 

I  l   o  lb 

where  each  of  the  above  partial  derivatives  is  evaluated  with  the  final 
(i.e.,  the  last  step  in  the  iterative  calculation)  calculated  values  for 
A  ,  n  and  n^ . 

The  symmetric  three  by  three  "coefficient  matrix"  C  is  constructed  using 
the  terms  in  Eqs.  (2).   The  six  different  elements  of  C  are  given  by: 


C,,  -  £a2  C__  =  Eb?         C__  =  Zc2 

11     i  22     i  33     l 


C._  =  Ea.b.  C00  =  Zb.c.       Con  =  Ec.a. 

12     li  23     ii        31     ii 


(3) 


(The  above  summations  are  from  i=0toi=Q.) 

The  elements  of  this  matrix,  together  with  the  numerical  values  of  the  observed 

counts,  are  used  to  determine  the  calculated  values  of  the  three  parameters. 

Let  s(A)  ,  s  (ri  )  ,  s(n,  )   represent  the  unbiased  estimates  of  the  standard 
o       b 

deviations  of.  A  t   n  and  n,  respectively.   The  corresponding  variances  from  the 

theory  of  least  squares  analysis  are: 

°2M  •  ( rhr)  cii  •  *2«o>  -  (s---r)  c22  •  s2^>  =(Vb)c33 

The  three  matrix  elements  are  the  diagonal  elements  of  the  inverse  coefficient 
matrix,  i.e.,  C  1.      The  term  S/(M  -  3)  which  appears  in  each  of  the  above 
expressions  is  called  by  some  authors  the  "weighted  variance  of  fit".5 


(4) 
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Prediction  of  the  Variances 

The  above  discussion  has  briefly  mentioned  those  features  of  the 
least  squares  analysis  of  actual  data  (as  applied  to  the  particular 
problem  of  single  exponential  decay  plus  constant  background)  which  are 
pertinent  to  the  main  topic  of  this  article,  ie.  the  prediction  of  the 
parameter  uncertainties.  Prediction  of  the  variances  of  the  parameters 
comes  under  the  general  subject  of  Prediction  Analysis  which  is  a  general 

method  for  predicting  the  uncertainties  of  results  that  should  be  ob- 

6 
tained  from  a  proposed  experiment  . 

Referring  to  Eqs.  (4)  we  note  that  each  of  the  variances  has  the 

S/(M  -  3)   factor.  As  mentioned  above,  the  mean  value  of  S  is  M  -  3. 

Thus,  the  mean  value  of  the  entire  factor  (ie.  the  weighted  variance  of 

fit)  is  unity.  This  is  the  average  value  of  the  factor  which  should  be 

obtained  if  the  experiment  were  to  be  repeated  many  times.  Accordingly, 

the  predicted  variances  are  defined  to  be: 

afro  -  e:J    ,       sfeo)  =  t'2l   ,         sfa^  *  £33  (5) 

Some  of  the  notation  of  the  preceding  reference  is  used  here,  where  the 
caret   (a  )  denotes  the  predicted  value  of  a  quantity.  Thus  the  diagonal 
elements  of  the  predicted  inverse  coefficient  matrix  appear  in  the  above 
expressions.  The  elements  of  the  predicted  coefficient  matrix  and  those 
of  the  predicted  inverse  coefficient  matrix  are  formed  as  follows: 

1.  Arbitrary  values  of  X    ,  n  and  n,   are  assumed.  They  will  still 
be  denoted  by  the  above  symbols  in  this  discussion. 

2.  Eq.  (1)  is  used  to  evaluate  the  expressions  in  Eq  (2)  together 

with  the  substitution  of  n.   for  N.   in  Eq.  (2). 

1        1 

3.  The  elements  in  Eq.  (3)  are  now  formed  using  the  predicted 
versions  of  Eq.  (2).  Thus  the  predicted  coefficient  matrix 
can  be  formulated. 

4.  Desired  elements  of  the  predicted  inverse  coefficient  matrix  are 
now  evaluated  from  the  predicted  coefficient  matrix. 

The  results  of  carrying  out  the  above  steps  to  determine  the  predicted 
variances  of  Eq.  (5)  are  expressed  most  simply  by  the  following  summari- 
zation where  first  we  have  for  the  squares  of  the  predicted  fractional 
standard  deviations: 


ih) 


Ltic  • 


«|o'=li|A  ,   %>=ti\B 
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(6a) 


(6b) 


(6c) 


With   P  defined  by  the  equation  P  =  Xt  ;   also   e  =»  fi,  /n  and  Q  =  M  -  1  , 

b   o 

we  have : 


G  -  1, 


raa1  x  2[ab][bc][ca1  -  [bblCacI2-  CcclCabl2 
~  [bb][cc]  -  [bc]^ 


(7a) 


(1  -  e"P)2 


.   2[ab][bc][ca]  -  [cc^ab]2-  [aa1[bcl2 

_bb]  + ■"-: — 2  — 

[cc][aa]  -  [ca] 


(7b) 


;  l/J   (<?V) 


[cc]  + 


2[ab][bc][ca]  -  [aa][bc]2-  [bb][ca]2 


[aa][bb]  -  [ah]' 


(7c) 


and  with  the  use  of  the  two  defining  equations, 


N(i)  =  Pe"P  -  (1  +  iP)(l  -  e"P)  ,   D(i)  =  eP  +  (1  -  e"P)e"lP 


*     Eqs.  (7)  are  evaluated  with: 


[aa]  =  Se'2iPN?i)/D(i)  ,  [bb]  =  Se"2iP/D(i)   ,  [cc]  =  El/D(i) 

[ab]  =  Ee"2iPN(i)/D(i)  ,  [be]  =  Ee"iP/D(i)    ,  [ca]  =  Ze"lPN(i)/D(i) 


(The  above  summations  are  from  i  =  0  to  i  =  Q) 

* 


RESULTS 


A  specific  value  of  P  =  Xt  =  t/(1/X)   denotes  the  magnitude  of  the 

time  interval  measured  in  units  of  mean-lives.   (1/X   is  the  mean-life  in 

units  of  time  corresponding  to  X   ;  the  corresponding  half- life  is!  In2/  X  ) 

i 
From  Eqs.  (6)  and  (7)  wc  note  that  G,  A  and  B  arc  functions  only  of  e  ,  p 

and  M;  thus  when  e  ,  p  and  M   arc  specified  the  quantities   G,  A   and  1$  can 

be  completely  determined.   Exact  numerical  values  for  these  quantities  have 
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been  calculated  for  representative  values  of  e ,  P  and  M  using  the 
IBM  360  computer.   The  above  expressions  for  G,  A  and  B  are  particu- 
larly suited  for  computer  calculation  since  for  given  values  of  €  and  P, 
each  additional  term  in  the  summations  (such  as  [aa],  [ab],  etc.) 
corresponds  effectively  to  a  new  value  of  M.  Accordingly  the  computer 
output  is  programmed  to  render  values  of  G,  A  and  B  for  all  possible 
values  of  M  up  to  the  maximum  value  specified  in  the  input. 

Assuming  that  the  predicted  variances  are  a  valid  measure  of  the  un- 
certainties to  be  expected  in  the  parameters,  a  study  of  G,  A  and  B 
with  respect  to  e ,  P  and  M  can  be  used  to  draw  some  useful  quali- 
tative conclusions  regarding  predicted  accuracy.   And  when  numerical 

values  are  chosen  for  X  and  h   ,  the  predicted  fractional  standard 

o  '     r 

deviations  (and  hence  the  predicted  variances  and  standard  deviations)  may 
be  determined  quantitatively. 

For  example,  with  assumed  values  of  e  =0.1,  P  =  1/200,  calculated 
values  of  G,  A  and  B  are  plotted  versus  MP  on  Fig.  1.   The  numerical 
value  of  MP  denotes  the  overall  measurement  time  in  mean- lives  corres- 
ponding to  M  points.   The  abcissa  on  all  subsequent  Figures  except  Fig.  3 
will  be  MP.   Although  the  values  of  MP  are  discrete,  the  results  are 
plotted  as  continuous  curves.  Note  that  for  given  values  of  X  and  ft   , 
every  different  combination  of  values  of  e,  P,  M  constitutes  a  unique 
experiment.   In  Fig.  2,  G  is  shown  versus  MP  for  e  =  0.1,  0.5  and  1.0 
where  P  =  1/200.   On  the  basis  of  these  results  we  may  say  as  expected,  that 
for  any  given  X  and  n  the  predicted  fractional  standard  deviations  of 
all  the  parameters  decrease  as  M  increases  (ie.  as  total  counting  time 
increases)  and  that  the  greater  the  amount  of  background  compared  to  the 
initial  activity,  the  larger  the  fractional  standard  deviation  for  X. 
Exact  comparisons  of  fractional  standard  deviations  for  different  cases 
may  be  made  with  the  above  results  if  desired. 

For  a  numerical  illustration  we  choose  a  decay  whose  mean- life  is 
1032  hours  (43  days)  and  whose  initial  activity  is  7  2,000  counts /hour  with 
one  tenth  of  that  as  the  background  activity,  ie.  7,200  counts /hour.   For 
a  counting  interval  of  1/200 'th  of  a  mean- life  (5.16  hours)  and  a  total 
counting  time  of  0.3  mean-lives  (12.9  days),  G  =  3.85  x  10  from  Fig.  1  or 
Fig.  2.  Applying  Eq .  (6a)  we  find  that  s|2(X)/X2  =  5.2  x  10~3;  so  S(\)A  = 
7.2  x  10"2  and  s(X)  =7.2  10_2/1032  =  6.8  x  10"5  hr."1.   Thus,  7.2%  fractional 
standard  deviation  is  predicted  for  the  determination  of  X  in  the  above 
experiment . 


Let  us  again  consider  the  least  squares  analysis  of  data  for  a  given 
experiment.   Repetition  of  this  experiment  would  yield  a  distribution  in 
the  values  of  each  of  the  calculated  parameters.   If  these  were  normal  dis- 
tributions then  one  could  say  for  one  of  the  parameters  (we  shall  use   X   as 
an  example)  that  68.37.  of  the  calculated  values  of  X  would  fall  within 
+  s(X)   of  the  true  value  of  X  ,  where   s(X)   is  the  unbiased  estimate 
of  the  standard  deviation  of  X   obtained  from  one  of  the  sets  of  data. 
In  general  for  n  -  p  degrees  of  freedom,  the  parameters  have  a  "t— distri- 
bution" rather  than  a  normal  distribution.   For  n  -  p  degrees  of  freedom, 

values  of  t  ,  __     arc  those  values  of  t  for  which  there  is  a  68.  '17, 
. 159, n-p 

probability  that  calculated  values  of   X   will  fall  within  +  s(X)t  icri 

r         J  —  .159, n-p 

of  the  true  value  of  X  .   The  total  interval  mentioned  above  is  referred 

to  as  the  68.37,  confidence  interval.   (Note  that  .159  =  (1  -  .683)/2  .) 

The  larger  n-p  becomes  the  more  closely  the   t-distribution  approaches 

the  normal  distribution.   For  example   t  1C_  orv  =  1.03  and   t  .  __    =1. 

r     .159,20  .159, OT 

It  will  be  assumed  then  that  a  comparison  of  ^(X)t  ,cn      (and  similarly 

. 159 ,n-p 

for  the  other  parameters)  will  give  a  valid  comparison  of  the  predicted 
uncertainties  of  the  parameters  for  the  assumed  values  of  e  ,  M  ,  and  P. 
For  the  results  shown  in  Figs.  1  and  2,  comparisons  without  the  t  values 
are  valid  since  all  the  corresponding  values  of  n-p  are  20  or  greater. 
For  the  purpose  of  valid  comparison  we  now  replace  the  predicted  standard 
deviations  in  Eqs.  (6)  with  the  product  of  t  1C.Q  and  the  standard  deviations 

(the  product  is  half  the  68.37>  confidence  interval)  resulting  in: 


s(X) 


-  Gt2 

1" J  Gt.  159  ' 


.159 


£(n.) 
-3— b 

1% 


The  t 


.159 


(8a)  (8b)  (8c) 

values  of  Fig.  3  arc  based  on  a  linear  interpolation  from  tables 


(8a,  b,  c) 

7,8 


Only  uncertainties  in  X  are  considered  in  the  results  that  follow. 

2 
Figs.  4  and  5  are  plots  of  Gt  ._„  versus  MP  for  representative  values  of 

e  ,  P  and  M. 

2 
Gt   ,.   versus  MP  is  plotted  on  Fig.  4  for  different  values  of  P  when 

€  =  0.01.   From  inspection  of  these  curves  it  becomes  apparent  l hat  (he  ex- 
perimenter may  achieve  good  precision  for   X   by  choosing  the  counting 
interval  to  be  about  %  of  a  mean-life  and  by  collecting  data  for  a  total 
of  about  10  mean-lives.   The  cost  in  additional  effort  of  using  a  counting 
interval  as  small  as  1/10  of  a  mean- life  would  not  produce  a  worthwhile 
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improvement  in  precision,  nor  would  counting  for  any  longer  total  time 

appreciably  improve  the  precision. 

2 
Gt  ,--  versus  MP  is  plotted  on  Fig.  5  where  P  ■  %,  over  a  range  of 

values  of  assumed  background  relative  to  h   .  The  conclusion  reached  in 
the  above  paragraph  regarding  total  time  of  counting  still  appears  to  be 
valid  provided  the  background  is  not  too  high.  Even  for  e  ■  0.2,  the 
fractional  standard  deviation  in  X  for  10  mean- lives  total  counting  time 
is  only  10ft  higher  than  for  counting  a  total  of  20  mean- lives.  The  dotted 
curve  included  in  Fig.  5  is  of  interest.  This  was  obtained  from  an  analysis 
of  a  different  case  where  no  background  activity  is  assumed.  For  this  case 
Eq.  (1)  would  not  have  the  n,   term  at  all  and  only  the  parameters  X  and 
n   would  be  calculated.  The  condition  e  =  0  defines  a  particular  value 
of  n,   for  the  three-parameter  case;   this  results  in  larger  uncertainties  . 
for  X  compared  to  the  two-parameter  case,  when  the  total  measuring  time  is 
less  than  12  mean- lives. 

The  computer  program  devised  for  the  case  treated  in  this  paper  has 
been  applied  to  a  large  range  of  values  of  e  ,  P  and  M  of  practical 
interest.  Representative  values  of  e  ,  P  and  M  were  chosen  for  the 
results  shown  in  Figs.  I,a2v  4,  and  5.   It  is  seen  from  Figs.  £  and  5 
that  if  desired,  the  limiting  values  of  the  standard  deviations  (or  half- 
confidence  intervals)  can  be  approached  by  choosing  M  sufficiently  large. 
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FIG.  I 


MP  (MEAN-LIVES) 
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FIG.  2 


MP  (MEAN-LIVES) 
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